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® NANBV diagnostics and vaccines. 

® A new virus. Hepatitis C vims (HCV). which has proven to be the ma|or etiologic agent of 43lood-borne 
NANBH. was discovered by Applicant The initial work on this virus, which Includes a partial genomic sequence 
of the prototype HOV Isoiate. is described in EPO Pub, No, 318.216. and POT Pub. No. WO/89/04e69 The 
present invention, which in part is based on new HCV sequences and polypeptides which are not disclosed In 
the above-cited publications, includes «ie application of these new sequences and polypeptides In immunoas- 
says, probe diagnostics. anti-HCV antibody production, PCR technology, and recombinant DNA technology 
included within the Invention also are novel Immunogenic polypeptides encoded wltfiin clones containing HCV 
cDNA. novel metiiods for purifying an immuncgenlc HCV polypeptide, and antisense polynucleotides derived 
from HCV cONA. 
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conditions described supra, for the expression of the fusion polypeptide. C100-3. The resulting polypeptides 
are screened using the sera from individuals with NANBH. described supra, for the screening of Im- 
munogenic polypeptides encoded in HCV cDNAs expressed In E. coll. 

6 

Comparison of flro Hydrophobic Profiles of HCV Polyprotelns with West Nile Virus Poiypro tein and with Dengue 

Virus NS1 ^ 

10 The hydrophobidty profile of an HCV poiyprotein segment was compared with that of a typical 
Ravivirus, West Nile virus. The polypeptide sequence of the West Nile virus poiyprotein was deduced from 
the l<nown polynucleotide sequences encoding the non-sfructurai proteins of that virus. The HCV poiy- 
protein sequence was deduced from the sequence of overiapping cDNA clones. The profiles were 
determined using an antigen program which uses a window of 7 amino acid width (the amino add in 

18 question, and 3 residues on each side) to report the average hydrophobidty about a given amino acid 
residue. The parameters giving the reactive hydrophobidty for each amino acid residue are from Kyte and 
Ooolittle (1982). Rg. 19 shows the hydrophobic profiles of the two polyprotelns: the areas corresponding to 
the non-stnicturai proteins of West Nile virus, nsl through ns5, are Indicated In the figure. As seen in the 
figure, there is a general similarity In the profiles of the HCV poiyprotein and the West Nile virus 

20 poiyprotein. 

The sequence of the amino adds encoded In the s'-reglon of HCV cDNA shown in Fig. 16 has been 
compared wHh the corresponding region of one of the sfralns of Dengue virus, described supra with 
respect to the profile of regions of hydrophobidty and hydrophilidty (data not shown). This comparison 
Indicated that the polypeptides from HCV and Dengue encoded in this region, which corresponds to the 
25 region encoding NSI (or a portion thereoO. have a similar hydrophoblc/hydrophlllc profile. 

The similarity in hydrophobidty profiles. In combination with the previously Identified homologies In the 
amino add sequences of HCV and Dengue HavMros in EP 0.218.316 suggests that HCV is related to these 
members of the Ravivirus family. 
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Characterization of the Putative Polypeptides Encoded Within the HCV ORF 

unJ^LT^ °^ *® ^leduced from the overlapping 

HCV CDNAs m the various dones described In EPO Pub. No. 318.216 and those described supra It may be 
deduced from the sequence that the HCV genome contains primarily one long continuous OFIF wtiidi 
encodes a poiyprotein. In the sequence, nudeofide number 1 con^sponds to the first nudeoUde of ttie 
Initiator MET codon; minus numbers indicate that the nucleotides are that distance away in the s'-direction 
(upstream), while positive numbers indicate that the nucleotides are that distaiice away in the 3'-direcflon 
(downstream). The composite sequence shows the "sense" strand of the HCV cDNA. 

The amino acid sequence of the putative HCV poiyprotein deduced from the HCV cDNA sense strand 
sequence Is also shown In Rg. 17. where position 1 begins with the putative Initiator methionine 

Possible protein domains of the encoded HCV poiyprotein. as well as the approximate boundaries, are 
*«e following (the polypeptides identified within the parentheses are those whldi are encoded In the 
Ravivirus domain): 



4S 



so 



EP 0 388 232 A1 



Putative Domain 


Approximate 




Boundary 




(amino acid 




nosj 


"C" (nucleocapsid protein) 


1-120 


"E" (Virion envelope protein(8) and possibly matrix (M) proteins 


120-400 


"NS1 " (complement fixation antigen?) 


400-660 


"NS2" (unknown function) 


660-1050 


"NS3" (protease?) 


1050-1640 


"NS4'* (unloiown function) 


1640-2000 


"NS5" (polymerase) 


2000-? end 



It should bo noted, however, that hydrophobicity profiles (described infra). Indicate that HCV diverges 
from the Ravivlrus model, particulariy with respect to the region upstream of NS2. IVIoreover, the 
boundaries indicated are not intended to show firm demarcations between the putative polypeptides. 



The Hydrophilic and Antigenic Profile of the Polypeptide 

Profiles of the hydrophillcity/hydrophobiclty and the antigenic Index of the putative polyprotein encoded 
in the HCV cDNA sequence shown in Rg, 16 were detemnined by computer analysis. The program for 
hydrophillcity/hydrophobicity was as described supra The antigenic index results from a computer program 
which relies on the following criteria: 1) surface probability. 2) prediction of alpha-helicity by two different 
methods; 3) prediction of beta-sheet regions by two different methods; 4) prediction of U-turns by two 
different methods; 5) hydrophllidty/hydrophobldty; and flexibility. The traces of the profiles generated by 
the computer analyses are shown in Rg. 20. In the hydrophilicity profile, deflection above the abscissa 
Indicates hydrophilicity. and below the absdssa indicates hydrophobicity. The probability that a polypeptide 
region is antigenic is usually considered to increase when there is a deflection upward from the abscissa in 
the hydrophilic and/or antigenic profile. It should be noted, however, that these profiles are not necessarily 
Indicators of the strength of the immunogenicity of a polypeptide. 



Identification of Co-linear Peptides in HCV and Flavlviruses 

The amino acid sequence of the putative polyprotein encoded In the HCV cONA sense strand was 
compared with tfie known amino acid sequences of several members of Fiaviviruses. The comparison 
shows that homology is slight, but due to tiie regions in which H Is found. It is probably significant The 
conserved collnear regions are shown in Fig. 21. The amino acid numbers listed below the sequences 
represent the number in the putath^e HCV polyprotein (See Rg. 17.) 

The spacing of these conserved motifs Is similar between ttie Fiaviviruses and HCV, and Implies tiiat 
there is some similarity between HCV and tiiese flaviviral agents. 

The following listed materials are on deposit under the Xems of the Budapest Treaty with the American 
Type Culture Collection (ATCC). 12301 Partdawn. Dr.. Rockviile. Maryland 20852. and have been assigned 
the following Accession Numbers. 
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